PhD Depth Examination Report Algebraic Foundation of Statistical Parsing Semiring Parsing

نویسندگان

  • Yudong Liu
  • Anoop Sarkar
چکیده

Statistical parsing algorithms are useful in structure predictions, ranging from NLP to biological sequence analysis. Currently, there are a variety of efficient parsing algorithms available for different grammar formalisms. Conventionally, different parsing descriptions are needed for different tasks; a fair amount of work is required to construct for each one. Semiring parsing is proposed to provide a generalized and modularized framework to unify all these different parsing algorithms into a general framework and by separation of the algebra and the algorithms, it makes the very same algorithm can perform across diverse tasks. One main concern about the semiring parsing system is the efficiency considerations. A packed representation for all possible target structures was discussed and different heuristic search strategies have been explored. By investigating more structured probabilistic models, we found that all the models are using the similar packed structures and apply the similar dynamic programming as classic inside-outside algorithms to the parameter estimation, which indicates that semiring parsing can be further extended to more complex models and can integrate more tasks, such as probabilistic learning. Semiring parsing turns out to have a solid theoretical foundation and has a promising perspective of applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی مقایسه‌ای تأثیر برچسب‌زنی مقولات دستوری بر تجزیه در پردازش خودکار زبان فارسی

In this paper, the role of Part-of-Speech (POS) tagging for parsing in automatic processing of the Persian language is studied. To this end, the impact of the quality of POS tagging as well as the impact of the quantity of information available in the POS tags on parsing are studied. To reach the goals, three parsing scenarios are proposed and compared. In the first scenario, the parser assigns...

متن کامل

An improved joint model: POS tagging and dependency parsing

Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...

متن کامل

Semiring Parsing

decorations of parse forests usingdynamic programming and algebraicpower series. Theoretical Computer Science.To appear.Tendeau, Frédéric. 1997b. An Earleyalgorithm for generic attribute augmentedgrammars and applications. In Proceedingsof the International Workshop on ParsingTechnologies 1997, pages 199–209.Viterbi, Andrew J. 1967. Error bounds forconvol...

متن کامل

Statistical Machine Translation by Generalized Parsing

Designers of statistical machine translation (SMT) systems have begun to employ tree-structured translation models. Systems involving tree-structured translation models tend to be complex. This article aims to reduce the conceptual complexity of such systems, in order to make them easier to design, implement, debug, use, study, understand, explain, modify, and improve. In service of this goal, ...

متن کامل

تأثیر ساخت‌واژه‌ها در تجزیه وابستگی زبان فارسی

Data-driven systems can be adapted to different languages and domains easily. Using this trend in dependency parsing was lead to introduce data-driven approaches. Existence of appreciate corpora that contain sentences and theirs associated dependency trees are the only pre-requirement in data-driven approaches. Despite obtaining high accurate results for dependency parsing task in English langu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004